Combining Parametric and Non-parametric Methods for Predicting Consumer Choice Using Supermarket Scanner Data
نویسنده
چکیده
It is important for national chain stores to be able to customize their prices in individual stores to adapt to the neighborhood demand. A lot of stores collect scanner data which can be used to determine the price distribution over products which optimizes profits, and yet often this data resource is under-utilized. In previous research both parametric (such as linear regression) and non-parametric (such as artificial neural networks) models have been successfully used for prediction. We propose to approach this problem from a different angle and using both parametric and non-parametric methods to investigate ways of combining the lower-level models into a high-level model so as to achieve a better predictor than either of the simpler methods by itself. We will work with the data provided by a supermarket chain which would like to be able to price more strategically the various products in its local stores. The scanner data is for the category Chilled Juices and consists of store-level weekly reports of prices and quantities sold for the 14 products in the category. The data was assembled for two years from 100 individual stores of the supermarket chain. Our research will produce a method able to predict for a given category of products at a store level the consumer demand of the products from their prices. Previous research indicates that micro-marketing pricing strategies can increase gross profit margins by 4% to 10%, which after administrative and operating costs are subtracted would translate into an increase of operating profit margins by 33% to 83%. This research will provide a valuable tool for marketers concerned with predicting consumer choice. The new models will also generalize to other Machine Learning applications where both methods are currently used.
منابع مشابه
A Bayesian Mixed Logit-Probit Model for Multinomial Choice
In this paper we introduce a new flexible mixed model for multinomial discrete choice where the key individualand alternative-specific parameters of interest are allowed to follow an assumptionfree nonparametric density specification while other alternative-specific coefficients are assumed to be drawn from a multivariate normal distribution which eliminates the independence of irrelevant alter...
متن کاملRegression Modeling for Spherical Data via Non-parametric and Least Square Methods
Introduction Statistical analysis of the data on the Earth's surface was a favorite subject among many researchers. Such data can be related to animal's migration from a region to another position. Then, statistical modeling of their paths helps biological researchers to predict their movements and estimate the areas that are most likely to constitute the presence of the animals. From a geome...
متن کاملA Bayesian Semiparametric Approach for Endogeneity and Heterogeneity in Choice Models
Marketing variables included in consumer discrete choice models are often endogenous. Extant treatments using likelihood-based estimators impose parametric distributional assumptions such as normality, on the source of endogeneity. These assumptions are restrictive as misspecified distributions have an impact on parameter estimates and associated elasticities. The normality assumption for endog...
متن کاملComparison of Parametric and Non-parametric EEG Feature Extraction Methods in Detection of Pediatric Migraine without Aura
Background: Migraine headache without aura is the most common type of migraine especially among pediatric patients. It has always been a great challenge of migraine diagnosis using quantitative electroencephalography measurements through feature classification. It has been proven that different feature extraction and classification methods vary in terms of performance regarding detection and di...
متن کاملA comparison of parametric and non-parametric methods of standardized precipitation index (SPI) in drought monitoring (Case study: Gorganroud basin)
The Standardized Precipitation Index (SPI) is the most common index for drought monitoring. Although the calculation of this index is usually done by using the gamma distribution fitting of precipitation data, studies have shown that for accurate monitoring of drought, the optimal distribution of precipitation in each month should be determined. On the other hand, in non-stationary time series,...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2002